An Evaluation of Kernel Equating: Parallel Equating With Classical Methods in the SAT Subject TestsTM Program
نویسندگان
چکیده
ETS, the ETS logo, and LISTENING. LEARNING. LEADING. are registered trademarks of Educational Testing Service (ETS). PRAXIS is a trademark of ETS. SAT SUBJECT TESTS and SAT REASONS TESTS are trademarks of the College Board. PSAT/NMSQT is a registered trademark of the College Board and the National Merit Scholarship Corporation As part of its nonprofit mission, ETS conducts and disseminates the results of research to advance quality and equity in education and assessment for the benefit of ETS's constituents and the field. ETS Research Reports provide preliminary and limited dissemination of ETS research prior to publication. To obtain a PDF or a print copy of a report, please visit: Abstract This study investigated kernel equating methods by comparing these methods to operational equatings for two tests in the SAT Subject Tests™ program. GENASYS (ETS, 2007) was used for all equating methods and scaled score kernel equating results were compared to Tucker, Levine observed score, chained linear, and chained equipercentile equating results. The results of the kernel chained equatings using a large fixed bandwidth were nearly identical to the results from the chained linear equatings whereas the comparisons of the kernel poststratification equatings using a large bandwidth showed differences from both Tucker and Levine observed score equating results, although for most of the score range those differences were small. Similarly, the differences were small for most of the score range in the comparisons of kernel poststratification equatings and kernel chained equatings using the optimal bandwidth with chained equipercentile equatings.
منابع مشابه
Contributions to Kernel Equating
Andersson, B. 2014. Contributions to Kernel Equating. Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Social Sciences 106. 24 pp. Uppsala: Acta Universitatis Upsaliensis. ISBN 978-91-554-9089-8. The statistical practice of equating is needed when scores on different versions of the same standardized test are to be compared. This thesis constitutes four contributions...
متن کاملSelection the best Method of Equating Using Anchor-Test Design in Item Response Theory
Explaining the problem. The equating process is used to compare the scores of the two different tests with the same theme. The goal of this research is finding the best method of equating data using Logistic model. Method. we are using the data of Ph.D. test in Statistic major for two consecutive years 92 and 93. For analyzing, we are specifically using the tests of Statistics major ...
متن کاملAn Alternative Continuization Method to the Kernel Method in von Davier, Holland and Thayer’s (2004) Test Equating Framework
von Davier, Holland and Thayer (2004) laid out a five-step framework of test equating which can be applied to various data collection designs and equating methods. In the continuization step, they present an adjusted Gaussian kernel method which preserves the first two moments. This paper proposes an alternative continuization method which directly uses the log-linear function from the smoothin...
متن کاملIRT Observed-Score Kernel Equating with the R Package kequate
The R package kequate enables observed-score equating using the kernel method of test equating. We present the recent developments of kequate, which provide additional support for item-response theory observed score equating using 2-PL and 3-PL models in the equivalent groups design and non-equivalent groups with anchor test design using chain equating. The implementation also allows for local ...
متن کاملA comparison of Van der Linden's conditional equipercentile equating method with other equating methods under the random groups design
To ensure test security and fairness, alternative forms of the same test are administered in practice. However, alternative forms of the same test generally do not have the same test difficulty level, even though alternative test forms are designed to be as parallel as possible. Equating adjusts for differences in difficulties among forms of the test. Six traditional equating methods are consid...
متن کامل